A Flexible Approach for a Fault-Tolerant Router
نویسندگان
چکیده
Cluster systems gain more and more importance as a platform for parallel computing. In this area the power of the system is strongly coupled with the performance of the network, which has to provide high bandwidth and low latency. Besides these performance aspects fault-tolerance within the network is very important. This paper shows how to build a flexible and faulttolerant router, the main building part of a network. In addition the overhead for the execution of fault-tolerant routing algorithms is examined.
منابع مشابه
CAFT: Cost-aware and Fault-tolerant routing algorithm in 2D mesh Network-on-Chip
By increasing, the complexity of chips and the need to integrating more components into a chip has made network –on- chip known as an important infrastructure for network communications on the system, and is a good alternative to traditional ways and using the bus. By increasing the density of chips, the possibility of failure in the chip network increases and providing correction and fault tol...
متن کاملDesign of an Active Approach for Detection, Estimation and Short-Circuit Stator Fault Tolerant Control in Induction Motors
Three phase induction motors have many applications in industries. Consequently, detecting and estimating the fault and compensate it in a way that the faulty induction motor satisfies the predefined goals are important issues. One of the most common faults in induction motors is the short circuit of the stator winding. In this paper, an active fault-tolerant control system is designed and pres...
متن کاملAn edge router based protocol for fault tolerant handling of advance reservations
To provide quality of service guarantees, resource reservation schemes have to maintain reservation states at the nodes along the path of a flow. Advance reservation schemes have to maintain these states for a long period of time. The loss of reservation state caused by node failures, makes advance reservation schemes highly susceptible to node failures. In this paper, we argue that a domain-by...
متن کاملCost-Efficient Fault-Tolerant Router Design for 2D-Mesh Based Chip Multiprocessor Systems
In this paper, a cost-efficient fault-tolerant router design, called 20-Path Router (20PR) architecture, is proposed to reduce the impacts of faulty routers for 2D-mesh based chip multiprocessor systems. The 20PR consists of two fault-tolerant circuits: 1) a Builtin Self-Test and Self-Diagnosis (BIST/SD) circuit to detect and locate faulty FIFOs and MUXs, and 2) a Fault-Isolation (FI) circuit t...
متن کاملFault-Tolerant Communication with Partitioned Dimension-Order Routers with Complex Faults
ÐThe current fault-tolerant routing methods require extensive changes to practical routers such as the Cray T3D's dimension-order router to handle faults. In this paper, we propose methods to handle faults in multicomputers with dimension-order routers with simple changes to router structure and logic. Our techniques can be applied to current implementations in which the router is partitioned i...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1998